A minimum classification error approach to pronunciation variation modeling of non-native proper names
نویسندگان
چکیده
In automatic recognition of non-native proper names, it is critical to be able to handle a variety of different pronunciations. Traditionally, this has been solved by including alternative pronunciation variants in the recognition lexicon at the risk of introducing unwanted confusion between different name entries. In this paper we propose a pronunciation variant selection criterion that aims to avoid this risk by basing its decisions on scores which are calculated according to the minimum classification error (MCE) framework. By comparing the error rate before and after a lexicon change, the selection criterion chooses only the candidates that actually decrease the error rate. Selecting pronunciation candidates in this manner substantially reduces both the error rate and the required number of variants per name compared to a probability-based baseline selection method.
منابع مشابه
Pronunciation Modeling and Lexical Adaptation Using Small Training Sets
A method for data-driven lexical adaptation on the basis of a limited number of acoustic training tokens is discussed. The method is closely related to pronunciation modeling techniques. A set of pronunciation variants is generated by forced alignment, followed by a step to select promising pronunciation candidates by using a ranking function. The method has been validated on a database consist...
متن کاملNon-native Pronunciation Modeling in a Command & Control Recognition Task: A Comparison between Acoustic and Lexical Modeling
In order to improve automatic recognition of English commands spoken by non-native speakers, we have modeled non-native pronunciation variation of Dutch, French and Italian. The results of lexical and acoustical modeling appeared to be source language and speaker dependent. Lexical modeling only resulted in a substantial improvement (of 35%) for the French speakers. Acoustic model adaptation ha...
متن کاملDiscriminative Pronunciation Learning Using Phonetic Decoder and Minimum-classification-error Criterion
In this paper, we report our recent research aimed at improving the pronunciation-modeling component of a speech recognition system designed for mobile voice search. Our new discriminative learning technique overcomes the limitation of the traditional ways of introducing alternative pronunciations that often enlarge confusability across different lexical items. Instead, we make use of a phoneti...
متن کاملModeling context and language variation for non-native speech recognition
Non-native speakers often face difficulty in pronouncing like the native speakers. This paper proposes to model pronunciation variation in non-native speaker’s speech using only acoustics models, without the need for the corpus. Variation in term of context and language will be modeled. The combination of both modeling resulted in the reduction of absolute WER as much as 16% and 6% for native V...
متن کاملImproving Pronunciation Accuracy of Proper Names with Language Origin Classes
Pronunciation of proper names that have different and varied language sources is an extremely hard task, even for humans. This thesis presents an attempt to improve automatic pronunciation of proper names by modeling the way humans do it, and tries to eliminate synthesis errors that humans would never make. It does so by taking into account the different language and language family sources and...
متن کامل